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ABSTRACT 

We compare multi-wavelength star formation rate (SFR) indicators out to z 3 in the GOODS-South 
field. Our analysis uniquely combines J7-to-8/zm photometry from FIREWORKS, MIPS 24 /im and 
FAGS 70, 100, and 160 /xm photometry from the PEP survey, and Ha spectroscopy from the SINS 
survey. We describe a set of conversions that lead to a continuity across SFR indicators. A luminosity- 
independent conversion from 24 /im to total infrared luminosity yields estimates of Liji that are in 
the median consistent with the Lm derived from PAGS photometry, albeit with significant scatter. 
Dust correction methods perform well at low to intermediate levels of star formation. They fail to 
recover the total amount of star formation in systems with large SFRjji/ SFRuv ratios, typically 
occuring at the highest SFRs [SFRuv+m ^ 100 MQ/yr) and redshifts (z > 2.5) probed. Finally, we 
confirm that iJa-based SFRs at 1.5 < z < 2.6 are consistent with SFRsed and SFRuv+iR provided 
extra attenuation towards HII regions is taken into account {Av.neb = ^y.contmuitm/0. 44). With the 
cross-calibrated SFR indicators in hand, we perform a consistency check on the star formation histories 
inferred from SED modeling. We compare the observed SFR-M relations and mass functions at a range 
of redshifts to equivalents that are computed by evolving lower redshift galaxies backwards in time. 
We find evidence for underestimated stellar ages when no stringent constraints on formation epoch 
are applied in SED modeling. We demonstrate how resolved SED modeling, or alternatively deep 
UV data, may help to overcome this bias. The age bias is most severe for galaxies with young stellar 
populations, and reduces towards older systems. Finally, our analysis suggests that SFHs typically 
vary on timescales that are long (at least several 100 Myr) compared to the galaxies' dynamical time. 
Subject headings: galaxies: high-redshift - galaxies: stellar content 



1. INTRODUCTION 

Determining when the stars were formed is one of the 
main goals in galaxy evolution. Two widely adopted ap- 
proaches to this question are to measure the assembled 
stellar mass at various lookback times (see, for the local 
universe, Gole et al. 2001; Bell et al. 2003, at interme- 
diate redshifts Bundy et al. 2006; Borch et al. 2006; 
Pozzetti et al. 2007; Vergani et al. 2008, and out to 
z - 5 Dickinson et al. 2003; Fontana et al. 2003, 2004, 
2006; Drory et al. 2004, 2005; Eisner et al. 2008; Perez- 
Gonzalez et al. 2008; Marchesini et al. 2009), or to 
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quantify the rate of on-going star formation over cosmic 
time (Madau et al. 1996; Lilly et al. 1996; Steidel et al. 
1999; Giavahsco et al. 2004; Schiminovich et al. 2005; 
Bouwens et al. 2007, 2010). While the latter should in 
principle integrate up to the former, modulo stellar mass 
loss, it is currently heavily debated whether or not the 
data, or rather the physical quantities estimated from 
them, satisfy this continuity equation (Hopkins & Bea- 
com 2006; Reddy & Steidel 2009). 

The relation between the instantaneous star formation 
rate (SFR) and stellar mass of individual galaxies sheds 
light on how (in bursts or gradually) and where (in galax- 
ies of what mass) the star-forming activity took place 
(Noeske et al. 2007; Elbaz et al. 2007; Daddi et al. 
2007a). At a given mass, observations show a rapidly in- 
creasing SFR with redshift (e.g., Lilly et al. 1996; Gowie 
et al. 1996; Bell et al. 2005), whereas the models pre- 
dict an increase in growth rate that is significantly slower 
(Dave 2008; Damen et al. 2009). 

Both issues urge us to critically investigate the assump- 
tions made in translating fluxes and colors to an estimate 
of the SFR. Especially the presence of dust greatly im- 
pacts the interpretation of multi-wavelength data. The 
SFR indicators considered in this paper come in two fia- 
vors: either the rate of unobscured and obscured star 
formation is summed (SFRuv+iR in this paper), or the 
rate of unobscured star formation is scaled up by a dust 
correction factor (SFRseb, SFRiJv,corr and SFRhol in 
this paper). The former requires knowledge of the total 
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infrared (IR) luminosity {Lifi) emitted in the rest-frame 
8-1000 fim range by dust that was heated by young stars, 
often extrapolated from the observed 24 /im photometry. 
The latter requires knowledge of how the observed colors 
break down in intrinsic colors of the stellar population 
and dust reddening, which both are model dependent. 

Here, we exploit the wealth of multi-wavelength 
data in the GOODS-South field, including 
HST/ACS, HST/WFC3, VLT/ISAAC, Spitzer/IRAC, 
Spitzer/MIPS 24 ^m, and Herschel/PACS 70 - 160 /im 
imaging, as well as VLT/SINFONI Ha spectroscopy to 
calibrate SFR indicators of galaxies out to z ~ 3. This 
work lays the basis for Wuyts et al. (2011b), where we 
analyze galaxy morphologies as a function of position 
in the SFR-Mass diagram. In order to carry out such 
a systematic study of the entire galaxy population 
over a large range of masses and SFRs, establishing 
a continuity across SFR indicators is essential. This 
is obviously of more general use as well. Doing so, 
our analysis places constraints on the distribution of 
dust within these galaxies, and on their star formation 
histories (SFHs). In addition, we exploit the inferred 
SFRs and SFHs to test whether galaxy populations at 
different lookback times satisfy a continuity equation. 

We present a brief overview of the data used in Section 
[21 Next, we make the case for a luminosity- independent 
conversion to L/a, in Section |31 SFRs derived by sum- 
ming a UV and IR contribution are contrasted to SED 
modeled SFRs and dust-corrected UV SFRs in Section 
m and to i?a-derived SFRs in Section [5l Section |6] dis- 
cusses the consistency of inferred SFHs, and introduces 
resolved SED modeling as a means to better characterize 
the age of the bulk of the stars in a galaxy. Finally, we 
summarize our results in Section [T] 

Throughout this paper, we quote magnitudes in the 
AB system, and adopt the following cosmological param- 
eters: (17m, r^A, /i) = (0.3,0.7,0.7). 

2. SAMPLE AND OBSERVATIONS 

2.1. FIREWORKS Ks-selected Catalog 

The backbone of this work is the i^^^-selected FIRE- 
WORKS catalog sampling the U-to-8fim regime with 16 
passbands {Ksjim — 24.3, 5a; Wuyts et al. 2008, here- 
after W08). Initially, we focus on the 1333 galaxies in 
the redshift range < z < 3 that were significantly de- 
tected (> 3(t) at 24fim. Redshifts are spectroscopic for 
69% of the sample below z < 1.5, and for 27% above 
z > 1.5. For the remaining sources, we derived high- 
quality photometric redshifts using EAZY (Brammer et 
al. 2008). The median and normalized median absolute 
deviation in Az/(1 + z) are (-0.002; 0.025) at z < 1.5 
and (-0.003; 0.044) at z > 1.5. 

2.2. PEP GOODS-S Survey 

The GOODS-South field has been observed with the 
Photodetector Array Camera and Spectrometer (PACS, 
Poglitsch et al. 2010) onboard the Herschel Space Obser- 
vatory as part of the PACS Extragalactic Survey (PEP, 
Lutz et al. 2011). With 5cr depths for prior extraction 
of 1.8 mJy, 1.9 mJy and 3.3 mJy at 70 /im, 100 /im, 
and 160 /im respectively, it is the only PEP field to be 
imaged at 70 /im, and a factor 2.5 - 4.5 deeper than the 
other PEP blank fields at 100 /im and 160 /im. These 



observed bands sample rest-frame wavelengths close to 
the peak of IR emission of galaxies in our redshift range 
of interest. For an in depth description of the observa- 
tions, data reduction and catalog building, we defer to 
Bcrta ct al. (2010) and Lutz et al. (2011). Briefiy, the 
PACS photometry used in this paper was obtained by 
PSF-fitting using the positions of MIPS 24 /xm detected 
sources from Magnelli et al. (2009) as prior. Adopting 
this positional prior reduces the effects of confusion, and 
is justified by the relative depths of the MIPS and PACS 
imaging. 

2.3. SINS GOODS-S Sample 

The SINS survey (e.g., Forster Schreiber et al. 2006, 
2009; Genzel et al. 2006, 2008) is a VLT/SINFONI GTO 
program targeting 80 high-redshift star-forming galaxies, 
providing integral-field emission line maps and integrated 
fluxes without slit losses. We focus on the subset of 25 
SINS galaxies located in the GOODS-South field with 
Ha observations spanning a redshift range 1.5 < z < 2.6, 
and a range in stellar mass from 5.1 x IO^Mq to 1.4 x 
10 ^^M0. Their location in the SFR versus mass diagram 
is representative for that of the underlying population of 
star- forming galaxies at similar redshift. 

3. SFRS FROM ADDING UV AND IR EMISSION 
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Fig. 1. — Star formation rates of galaxies at < z < 3 based 
on UV plus 24 emission versus UV plus PACS emission. The 
IR part of the SFR is proportional to the total IR luminosity Ljji, 
which was derived monochromatically using conversions and tem- 
plate sets from CEOl, DH02, W08, and Nil. Adopting the locally 
calibrated luminosity-dependent recipes from CEOl and DH02 to 
convert observed 24 /im fluxes to Ljr leads to overpredicted SFRs 
at the high-redshift and high-SFR end. For Nil, these cases are 
limited to redshifts above z > 2.5, where the 24 /im filter enters 
Arest < 6 /im, and the CEOl/Nll templates lack a physically re- 
alistic calibration (open gray circles). 

In this Section, we analyze SFR indicators based on 
adding the unobscured and obscured star formation, 
traced by the UV and IR emission respectively. With 
the advent of PACS/Herschel, the emission at or near 
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Fig. 2. — Top: Spectral energy distribution template by Wuyts et 
al. (2008), with the observed photometry of PACS-detected galax- 
ies at 2 < 3 in the GOODS-South field ovcrplotted at the respective 
rest-frame wavelengths. For every galaxy, a uniform scaling is ap- 
plied to its 24—160 /im photometry so as to normalize the observed 
24 fira flux (gray) to the template {black) convolved by the MIPS 
passband. Large black symbols indicate the binned median and 
standard deviation of the scaled PACS photometry, matching the 
template, albeit with a significant (0.25 dex) scatter. Small black 
circles mark sources with an X-ray luminosity above 10^^ erg . 
Bottom: Residual from the Wuyts et al. (2008) template as func- 
tion of inferred total IR luminosity. Large empty boxes and filled 
circles indicate the binned median, with and without galaxies at 
z < 0.2 respectively. Excluding the most nearby galaxies (small 
empty circles), only a marginal trend with IR luminosity remains. 



the IR peak can now be probed directly out to high red- 
shift to unprecedented depths (see, e.g., Nordon et al. 
2010; Elbaz et al. 2010). Combining PACS and SPIRE 
data, Elbaz et al. (2010) found that the total IR lumi- 
nosity (i/_R = 1/(8 — 1000 /im)) can be derived reliably 
from a single PACS band. We will therefore restrict our- 
selves to such monochromatic conversions to Lm in this 
paper. 



The depth of PACS imaging in the GOODS-South field 
(2.0 mJy at 160 fim, 3a) is among the deepest of all IR 
lookback surveys. In most other fields, the derivation of 
Lju or SFRjjv+iR from PACS photometry has to rely 
on stacking analyses to reach the same depths. In order 
to preserve information on an individual object level, IR 
luminosities for all but the brightest galaxies must then 
be derived from MIPS 24 /im. Here, we exploit the deep 
mid- and far-IR data in GOODS-South to test for inter- 
nal consistency between them. 

We consider four conversion recipes. The conversions 
by Chary & Elbaz (2001, hereafter CEOl) and Dale & 
Helou (2002, hereafter DH02) rely on locally calibrated 
template libraries, where the template adopted from 
the library depends on the galaxy's luminosity (gener- 
ally yielding larger conversion factors for more luminous 
galaxies at a given redshift). Studying PACS observa- 
tions in GOODS-North, Nordon et al. (2010) argued 
for enhanced emission of polycyclic aromatic hydrocar- 
bons (PAHs) in z ~ 2 galaxies relative to local galaxy 
SEDs of the same Lju- Accounting for this, Nordon et 
al. (2011, hereafter Nil) formulate a recalibration of 
the CEOl library for the 1.5 < z < 2.5 regime (below 
z < 1.5, the original CEOl conversion is used). Specif- 
ically, these authors find little to no luminosity depen- 
dence of the rest-frame 20-60 /im SED shape for the 
Liu ^ 10^^ Lq regime probed. In the mid-infrared, their 
conversion has a luminosity dependence, but such that 
the ratio vL^irest — Sjim)/ Ljji at a given redshift and 
luminosity is larger than that from CEOl. Finally, Wuyts 
et al. (2008) introduced a luminosity-independent con- 
version based on a single template, which was motivated 
by MIPS 24 /xm, 70 /xm, and 160 fj,m stacking results by 
Papovich et al. (2007). This template was constructed 
by averaging the logarithm of DH02 templates with the 
parametrization a of the intensity of the interstellar ra- 
diation field in the range 1 < a < 2.5 in steps of 0.0625. 

In terms of local analogs, its mid- to far-infrared SED 
shape is more reminiscent of M82 than of Arp220. 

In Figure [1] we present a comparison of SFRijv+24fj.m 
to SFRuv+PACS for PACS-detected galaxies out to z = 
3, for each of the above Lm conversion recipes. In each 
case, the total SFR is derived following Kennicutt (1998): 

SFRuv+iR [Mo yr-i] = 1.09x10-1° (L/fl,+3.3L28Oo)/i0 

(1) 

where the relative scaling of the UV and IR contri- 
bution is based on calibrations in the local universe, 
and the overall scaling factor assumes a Chabrier (2003) 
IMF. The rest- frame luminosity L2800 = i^L^(2800A) 
was computed with EAZY from the best-fitting SED, it- 
self a superposition of the 6 EAZY principle-component 
templates. Only at the lowest redshifts (z < 0.3), rest- 
frame 28OOA falls blueward of the available photome- 
try. For SFRuv+PACS, we use the longest wavelength 
PACS band that had a significant (> ia) detection. The 
color-coding indicates the redshift of the galaxies. It is 
a well-known fact that galaxies that are highly actively 
star-forming are more common at larger lookback times. 
The lower end of the SFR distribution at a given redshift 

The template, as well as a table with conversion factors from 
MIPS 24 A tm, and PACS 70 Atm, 100 u rn and 160 /im to Lm is 
released at |http://www.mpe.mpg.de/~| swuyts/Lir .template. html 
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is simply determined by the flux limit of our sample. 

At z < 2 and SFR < 100 Me/yr, we find little differ- 
ence between the PACS and 24 /im-based estimates, ir- 
respective of the conversion recipe used. However, above 
z > 2 and for SFRs above 100 Afe/yr, the 24 ^m- 
based estimates using the locally calibrated CEOl and 
DH02 conversions are clearly overestimated with respect 
to those based on PACS photometry. At 1.5 < z < 2.5, 
no such systematic bias is present in the case of the 
W08 and Nil conversions. These results imply that, 
rather than having mid- to far-IR SEDs as local ULIRGs, 
ULIRGs at z ^ 2 are better characterized as scaled-up 
versions of local LIRGs, which are colder than nearby 
ULIRGs. This finding is in agreement with recent results 
by Nordon et al. (2010), Elbaz et al. (2010), Muzzin et 
ai. (2010), and Symeonidis et al. (2011). 

We point out that any variation in vL^l2AiJLm) / Lm, 
where Li,(24/xm) refers to the specific luminosity probed 
by the observed 24 fxm band, translates superlinearly 
to variations in Lifi{2A fim)/ Luj (160 fim) when ap- 
plying a luminosity-dependent conversion. This leads 
to a somewhat increased scatter around unity in the 
Ljii{24 /xm)/L/ij(160 /im) ratio of 1.5 < z < 2.5 galax- 
ies for Nil (- 0.4 dex) compared to W08 (-- 0.3 dex), 
since the scatter in vLi,{2A^m) / Lm at a given lumi- 
nosity is large compared to any systematic variation of 
vL^{24:fim) /Lju over the luminosity range probed. More 
importantly, the luminosity-independent conversion by 
W08 has the advantage of being applicable also above 
z > 2.5, where Nil shows more systematic outliers. This 
is at least in part due to the fact that the 24 /im band 
starts entering the Xrest ^ 6 fim regime where the CEOl 
templates lack a physically realistic calibration. 

In order to address the uncertainties involved in con- 
verting 24 /im to Lm in more detail, we present the 
W08 template in Figure [2] Overplotted, we show the IR 
photometry of < z < 3 galaxies that are individually 
detected (> 3cr) in at least one PACS band. Their 24 
l^m flux is normalized to match the template flux within 
the passband at the respective redshift, but their 24 /im 
- PACS colors are left as observed. Black filled circles 
and error bars mark the median and central 68th per- 
centiles for PACS-detected galaxies at 0.2 < z < 3, 
respectively. Gray empty circles and error bars illus- 
trate small changes when including the lowest redshift 
{z < 0.2) sources. Although a significant scatter (0.25 
dex in Alogvf^) is notable, the observations are in the 
median consistent with the W08 template. Moreover, 
Figure [2}d demonstrates that the residuals from the tem- 
plate do not strongly correlate with the Lm derived from 
the respective PACS band, especially when excluding lo- 
cal (z < 0.2) galaxies, that dominate the lowest lumi- 
nosities (10^ Lq < LjR < 10^° Lq). In other words, 
we observe a spread in 24 /im - PACS colors, even at a 
given redshift, but this spread does not seem to correlate 
strongly with galaxy luminosity. As a consequence of the 
observed fiux limits, different luminosity bins in Figure 
will naturally be populated by galaxies of a different 
redshift distribution. This could potentially mask some 
luminosity dependence present within a narrow redshift 
slice. Including also observations at 16 /im. Nil investi- 
gate this by probing the rest-frame 8 /im luminosity in 
bins of both Liji and redshift, finding a dependence on 
both albeit with significant scatter. While not captur- 



ing these physics, the W08 SED can be considered as an 
effective template offering a simple and robust conver- 
sion from observed 24 /im photometry to i/^j. The 7.7 
/tm PAH strength may increase with redshift relative to 
Lifi (see Nil), but for the purpose of converting MIPS 
24 /im to L/_R, it is sufficient that its strength in the 
template is appropriate for z ~ 2, where this feature is 
sampled by the 24 /im passband. Given that the scatter 
in vL^{24,^m) / Lju is large compared to any variation 
with luminosity, not accounting for a luminosity depen- 
dence has the compensating advantage that deviations in 
i/Li,(24/im) at a given luminosity do not translate non- 
linearly to L/ij(24/im). Another parameter potentially 
contributing to the scatter observed in Figure [^b, is the 
dust temperature. More specifically, variations in the 
dust temperature from galaxy to galaxy would express 
themselves as deviations from a unique SED shape, and 
hence a source of scatter in Figure ^p. Magdis et al. 
(2010) recently reported a wide range of dust tempera- 
tures in high-redshift ULIRGs. For an in depth discus- 
sion of the relation between dust temperatures and other 
characteristics of high-redshift galaxies (such as SFR and 
mass), we defer the reader to Magnelli et al. (in prep). 
In the Appendix, we briefly discuss possible contribu- 
tions by Active Galactic Nuclei (AGN), but argue that 
they do not affect the above results for the ensemble of 
PACS-detected galaxies. For a first exploration of AGN 
in the context of PACS observations, we refer the reader 
to Shao et al. (2010). 

In the above analysis, we focussed on PACS detections 
only. While half of the 24 /im sources in GOODS-South 
are not detected at any PACS wavelength, the PACS 
sample is matched quite well to the 24 /im depth of 
other, shallower fields. For example, 90% of the galax- 
ies brighter than the 24 /im detection threshold of COS- 
MOS are PACS-detected in GOODS-South. This implies 
that the conversion described in this section can safely 
be applied to translate the COSMOS 24 /im photome- 
try to Lju. We briefly explored how the median SED 
shape changes when going beyond the PACS detection 
threshold by means of stacking. We computed the me- 
dian stacked fluxes in bins of rest-frame wavelength now 
including all sources detected at 24 /tm. Stacking was 
performed after scaling the PACS postage stamps by the 
same factor used to normalize the 24 /im flux of the corre- 
sponding object to the W08 template. Postage stamps of 
PACS-undetected sources were cut out of residual maps 
from which all individually detected sources were sub- 
tracted. We find that the blue side of the far-IR bump 
(20 /im < \rest < 60 /im) is somewhat more depressed, 
by ~ 0.15 dex with respect to the W08 template to which 
the 24 /im flux was normalized. However, the amplitude 
of the peak in IR emission, where the bulk of the energy 
emitted at 8 - 1000 /im comes from, is well matched by 
the template. The 8 - 1000 /im integral over the DH02 
or CEOl template that best fits the shape of the median 
stacked SED deviates by merely a percent from that of 
the W08 template, making the latter an acceptable basis 
for deriving L/^ for our full 24 /im-detected sample. 

We conclude that a continuity between 24 /im and far- 
infrared based SFR indicators can be obtained out to 
z — 3, albeit with significant scatter. The good median 
performance of the W08 conversion in the light of new 
Hcrschel observations is encouraging for recent studies 
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in which it was applied (Franx et al. 2008; Wuyts et al. 
2009b; Damen et al. 2009; Kajisawa et al. 2010). 

4. SFRS FROM SED MODELING AND DUST-CORRECTED 
UV EMISSION 

In GOODS-South, not only the PACS but also the 
MIPS depth (0.02 mJy at 24 ^m, 5cr) is one of the deep- 
est available. Consequently, in studies that exploit larger 
areas to reduce cosmic variance effects, one will be lim- 
ited by the shallower MIPS imaging of other fields. Other 
SFR tracers than IR-based methods are therefore indis- 
pensable. 

4.1. Cross-calibration to SFRuv+iR 

Modeling of broad-band SEDs with stellar population 
synthesis models is one of the most commonly used ap- 
proaches, as it is a more generally accessible means to 
derive SFRs (among other stellar population properties) 
for galaxies over a wide range of redshifts and types. It 
can also be extended to fainter galaxies with lower SFRs. 

We investigate the performance of this technique, 
which is essentially a dust correction method, by com- 
paring it to SFRuv+iR for galaxies with a PACS and/or 
MIPS detection. The Lm that enters the computa- 
tion of SFRuv+iR was calculated using the W08 tem- 
plate and the longest wavelength IR band that had a 
significant detection. We adopt default assumptions re- 
garding SED modeling. Briefly, exponentially declining 
SFHs, also known as r models, were constructed from the 
Bruzual & Chariot (2003, hereafter BC03) stellar pop- 
ulation synthesis models, and fitted to the C/-to-8 /im 
FIREWORKS SEDs using the fitting code FAST (Kriek 
et al. 2009). We are aware that increasing star formation 
histories have recently been proposed as more appropri- 
ate for galaxy evolution at early times (z > 3, Renzini et 
al. 2009; Maraston et al. 2010; Papovi ch et al. 2011), 
and explore this option further in Section r6.2.2l For now, 
we restrict the analysis of our < z < 3 sample to de- 
fault r models whose wide use in the literature makes it 
worthwhile testing their performance in light of the new 
IR constraints. As throughout the paper, we adopt a 
Chabrier (2003) IMF, and we assume solar metallicity. 
We required the time since the onset of star formation to 
be at least 50 Myr, and not exceeding the age of the uni- 
verse at the epoch of observation. We experimented with 
minimum e-folding times varying between 10 Myr and 
10 Gyr. The fitting procedure minimizes the statistic 
taking discrete steps of 0.1 dex in age and r within the 
allowed range. We allow the galaxies to be attenuated 
within a range of visual extinctions < Ay < 4, in steps 
of 0.1 mag, assuming a uniform foreground screen geom- 
etry, and with the reddening following the Calzetti et al. 
(2000) law. We refer to the instantaneous SFR of the 
best-fit model as SFRsed- 

We find the best correspondence to SFRjjv+ir when 
forcing the e-folding time to be larger than log Tmin = 
8.5. The plotted points in Figure [3] correspond to this 
set of boundary conditions. The majority of galaxies, 
at low to intermediate (SFRuv+ir. ^ 50 MQ/yr) SFRs 
line up along a ridge line which coincides with the one- 
to-one relation. A tail toward lower SED-modeled SFRs 
is present, and contains more than 50% of the galaxies 
at the high SFR (and high redshift) end. When allowing 
shorter e-folding times, we find these are often preferred 
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Fig. 3. — The SFRs of IR-detected galaxies at < z < 3 
based on adding the unobscured (UV) and re-emitted (IR) ra- 
diation of newly formed stars versus SFRs from stellar popula- 
tion modeling of observed optical-to-mid-IR SEDs. Here, SFRggn 
was derived using BC03 models with exponentially declining SFHs 
(with log(rmi„) = 8.5). For modest levels of star formation 
{SFR < 50 Mq yr~^), both SFR indicators agree well in the 
median (large black circles, with error bars indicating the central 
68th percentile). At higher SFRs, which become increasingly more 
common toward higher redshift, the SED-modeled SFR systemat- 
ically underpredicts that inferred from UV -I- IR. Red and yellow 
curves illustrate the median relation when adopting less stringent 
constraints (log(Tmi„) = 7 and 7.5 respectively) on the minimum 
e-folding time. The orange line marks the median relation when 
allowing very long (log(rmi„) = 10) e-folding times only. 



in a least-squares sense, but leave a systematic offset 
in SFR in comparison to measurements of SFRjjv+ir- 
The red and yellow line in Figure [3] indicate the median 
trend for logr,„i„ = 7.0 and 7.5 respectively. It thus 
seems that e-folding timescales of at least several 100 
Myr give results that are more consistent with UV-l-IR 
based estimates than more rapidly declining histories. ^'^ 
In Section 16.21 we will describe independent evidence in 
this direction, based on the implausibly young ages in- 
ferred from fits with small t values. Changing log(T,„i„) 
from 8.5 to 9 leads to a similar performance in the com- 
parison to SFRuv+iR- However, when allowing very 
long e-folding times only (log(r„„) = 10), SFRsed 
becomes an overestimate at intermediate star formation 
levels (see orange curve in Figure Overall, we recom- 
mend log(rmm) = 8.5, for its effectiveness at reproducing 
the low to intermediate SFRs while not overly restricting 
the allowed range of SFHs. 

To contrast this timescale of ^ 300 Myr for varia- 
tions in star formation activity with another characteris- 

If we were to average the best-fit SFH over the last 100 Myr 
instead of adopting the instantaneous SFR, which for t models 

implies an increase by a factor ^qq]^^^ (^^ moAf yi- _ ^ the depen- 
dence on Tmi„ reduces, and SFRs derived with short T^in would 
agree better with SFRuv+iR- However, given t he im plausibly 
young ages associated with those fits (see Section 16.21 1 longer e- 
folding times still seem preferred. 
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Fig. 4. — Ratio between SFRggu and SFR(7y_)./jj as function of 
relative fraction of radiation from young stars that is re-processed 
by dust versus escaping witliout attenuation. The underprediction 
of the total amount of star formation by SFRsed (i-e., a dust 
correction method) is most severe for galaxies that have the largest 
SFRiji/ SFRuY ratio. These conditions are more prevalent in 
vigorously star-forming systems, as found more commonly at high 
redshift. Saturation of reddening as an extinction tracer due to a 
patchy dust distribution is responsible for this effect. 



tic timescale of the galaxies in our sample, we computed 
the dynamical time tdyn ~ \J 2c; m °^ galax- 
ies that are covered by the HST/WFC3 Early Release 
Science program. Here, we measured the effective ra- 
dius R by fitting Sersic models to the /f-band surface 
brightness distribution using GALFIT v3.0 (Peng et al. 
2010). We find typical r/tdyn ratios of order ^ 50, with 
large variations from 10 to 100 and up. This implies 
that most star formation happened in a relatively sta- 
ble mode over long timescales, as opposed to a single 
short burst taking place on a dynamical time. Similar 
conclusions were drawn by Genzel et al. (2010), who 
pushed the study of the Schmidt-Kennicutt law (Schmidt 
1959; Kennicutt 1998) and the Elmegreen-Silk relation- 
ship (Elmegreen 1997; Silk 1997) for normal star-forming 
galaxies to z ^ 1 — 3. 

Despite the good overall correspondence between the 
SFR indicators, the tail of systematic underestimates by 
SFRgED: reaching an order of magnitude at the largest 
SFRijv+iR ~ 300 M^lyr (predominantly galaxies at 
z > 2.5) remains worrisome. Such a trend at the high- 
SFR end was previously signaled by Santini et al. (2009) 
on the basis of pre-Herschel data sets. Figure |3] sheds 
light on its origin. The deviation from a one-to-one rela- 
tion between SFRjjv+iR and SFRsed is a clear func- 
tion of the relative ratio of emission from young stars 
that is re-processed by dust versus escaping unhindered 
as UV light. The diagram illustrates that particularly 
when the unattenuatcd UV emission reveals only a few 
percent of the total star formation (i.e., when the dust 
correction factor is very large), a simplistic dust correc- 
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Fig. 5. — The SFRs of IR-detected galaxies at lA < z < 2.5 
based on adding the unobscured (UV) and re-emitted (IR) ra- 
diation of newly formed stars versus the Daddi et al. (2007) 
dust correction method which relies solely on the observed B — z 
color. Large filled and open symbols indicate the median in bins 
of SFRuY^m and SFRuY,corr respectively. Whereas the two 
methods yield consistent results at modest SFR (< 50 Mq yr~^) 
levels, the dust correction method fails to recover the total amount 
of star formation at the highest SFRj/y^./^ bin. Note that in- 
ferred systematic offsets are dependent on the K-band limit of the 
sample. 



tion method based on a uniform foreground screen does 
not account properly for optical depth effects. The rea- 
son is that information on the dust correction factor is 
not directly embedded in the galaxy's SED. It is the red- 
dening of the spectral slope that is directly traced, but 
only translates linearly to extinction when the obscuring 
material has the configuration of a foreground screen. 
In case of patchy dust obscuration, or dust that is mixed 
with the emitting sources, reddening saturates as a tracer 
of extinction. In the case of selective extinction towards 
young star-forming regions, the resulting underestimate 
of the SFR can be even more pronounced (Poggianti et al. 
2001; Wuyts et al. 2009a). The same effect is seen in lo- 
cal ULIRGs (Goldader et al. 2002) and starburst galaxies 
(e.g., Forster Schreiber 2001 and references therein) that 
lie above the well-known IRX-/3 relation (Meurer et al. 
1999), and in hydrodynamic simulations of high- redshift 
(merging) galaxies (Wuyts et al. 2009a, Fig. 10). Our 
conclusion confirms the recent results by Reddy et al. 
(2010), who find that galaxies with large IR luminosities 
{L > 10^^ Lq) at z ~ 2 also lie above the IRX-/3 relation, 
unlike more modestly star-forming galaxies at the same 
epoch. 

4.2. Single-color SFRs for Star-forming z ^ 2 Galaxies 

Daddi et al. (2004; 2007a) introduced an observed 
color criterion to dust-correct UV SFRs of BzK galaxies 
at 1.4 < z < 2.5. Since it is based on observed-frame, 
rather than rest-frame photometry, it has the limitation 



7 



of only being applicable to the redshift range for which 
it was designed. On the other hand, it has the advan- 
tage of being more easily accessible, as it only requires 
B and z photometry (and K in order to select the high- 
redshift galaxies in the first place). Furthermore, it has 
been argued that the use of such a limited wavelength 
range avoids dilution of the information on the rate of 
star formation by degeneracies with other stellar popu- 
lation parameters. It is therefore useful to address the 
performance of the Daddi et al. (2007a) criterion in light 
of the SFRjjv+iR calibrations (Section [S]), that are now 
motivated by the new Herschel/PEP photometry. 

Figure [5] presents a comparison of SFRjjv+iR to 
the single-color dust correction method by Daddi et al. 
(2007). We plot all galaxies with an IR detection (i.e., for 
which SFRjjv+iR is not an upper limit), most of which 
satisfy the BzK criterion. Despite the limited range in 
redshift and SFR probed, a similar trend emerges as seen 
for the SED modeling (Section |4.1|) . While at intermedi- 
ate SFRjjv+iR, the two SFR indicators agree well, the 
dust correction method fails to recover the total amount 
of star formation at the largest SFRjjv+ir- We note 
that, when binning in SFRjjv.corr (i-e., the SFR indi- 
cator to be tested) instead of SFRuv+ir (our reference 
SFR indicator), the median SFRuv+iR of the highest 
S F Rifv,corr bin actually lies below the one-to-one rela- 
tion (large open circles). The fact that the median binned 
relation depends on the axis that is binned reflects a sig- 
nificant scatter in the relation between the UV-I-IR and 
dust-corrected SFR. The absence of a tight relation may 
stem from variations in the dust distribution from galaxy 
to galaxy. The color-coding in Figure [5] furthermore indi- 
cates that the scatter is not random, but that instead the 
relation between the two SFR indicators is a function of 
i^-band magnitude. This illustrates that the offset with 
respect to SFRuv+ir depends on the properties of the 
galaxy sample. E.g., stacking if -bright {K < 22) galax- 
ies in bins of SFRuv,corr Nordon et al. (2010) found an 
excess of 0.3 dex for dust-corrected UV SFRs. 

We conclude that we do not find any evidence that a 
single color dust correction method would be more pre- 
cise than SFRs from modeling of the full U-to-8 fim SED. 
Both dust correction methods perform equally well in the 
redshift range 1.4 < z < 2.5 where they overlap, with 
similar biases for the dusty, most rapidly star-forming 
systems. Resolved SED modeling on the scale of the 
patchiness of the dust distribution may help to overcome 
the saturation of reddening as an extinction tracer when 
IR data are not available (see Section 16.41 and in more 
detail Wuyts et al. in prep). 

4.3. Dependence on Stellar Population Synthesis Models 

Even under identical assumptions on SFH and dust 
reddening, differences between stellar population synthe- 
sis codes give rise to systematic uncertainties in the de- 
termination of stellar population properties. The im- 
pact of the treatment of certain stellar evolutionary 
phases, such as the thermally pulsating AGE phase, on 
the estimated stellar mass and age of (high-z) galax- 
ies has been discussed extensively elsewhere (see, e.g., 
Maraston 2005, hereafter M05; Maraston et al. 2006; 
Wuyts et al. 2007). In Figure [6l we focus on the im- 
plications for SFR recipes. The top two panels com- 
pare SFRjjv+iR to SFRs ED as derived with M05 and 
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Fig. 6. — Idem as Figure |3] but using different stellar popu- 
lation synthesis codes (M05 and CB07 instead of BC03) to de- 
rive SFRsED- In thfi top panels, identical constraints on the SFH 
(specifically the minimum allowed e- folding time Tmin) as for Fig- 
ure |3] were used. This leads to SED-modeled SFRs that are more 
discrepant from SFR(7y+/fl than SFRsed,bC03- By tuning Tmin 
(bottom panels), a correspondence to SFRfyv+lJi of similar quality 
can be obtained. In all cases, the saturation ol^ reddening as extinc- 
tion tracer in dusty, star-bursting (high-redshift) galaxies leads to 
systematic underestimates of SFR by the SED modeling method. 



Chariot & Bruzual (2007, hereafter CB07) models. We 
adopted the same restriction of a minimum e-folding time 
of log T„ii„ = 8.5 that provided the best agreement for 
BC03 models. Similar to the BC03 comparison in Figure 
[21 the SED modeled SFRs lead to underestimates at the 
high SFRuv+iR end. However, at intermediate and low 
SFRs the behavior looks markedly different, especially 
for the CB07 models that lead to consistently lower es- 
timates of the SFR by ^ 0.4 dex. Adjusting a single pa- 
rameter Train in our SED modeling recipe, it is possible to 
calibrate both SFRsed, mq5 and SFRsed, csot such 
that they produce an equally good match to SFRuv+ir. 
as SFRsed, sco3- In other words, our cross-calibration 
of SFR indicators cannot discriminate between different 
stellar population synthesis models. We find, however, 
that BC03, M05 and CB07 models require different con- 
straints on the minimum e-folding time {logTmin = 8.5, 
7.5, and 9.5 respectively) in order to optimally match the 
UV-I-IR based indicator. 

Figure [6] demonstrates that the SED fitting results 
based on M05 and CB07 models show qualitative dif- 
ferences despite the fact that both contain an updated 
implementation of the TP-AGB phase, leading to in- 
creased rest-frame K-h&nd fiuxes and lower stellar mass 
estimates. The two stellar population synthesis models 
differ in at least two aspects. First, the boost in rest- 
frame if-band light due to TP-AGB stars occurs at ear- 
lier times for CB07 (around log age ^ 8.5) compared to 
M05 (around log age ^ 9). Second, the optical (rest- 
frame U — V) colors by CB07 are identical to those of 
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BC03, whereas they are redder by nearly 0.2 mag in the 
M05 models for most of the ages considered. This stems 
from the use of different stellar tracks that do (in the 
case of CB07) or do not (for M05) include convective 
overshooting (see Maraston et al. 2006; Fagotto et al. 
1994 for the Padova stellar tracks used by CB07; Cas- 
sisi et al. 1997 for the Frascati stellar tracks used by 
M05). Together, these differences lead to best-fit solu- 
tions that in the case of M05 correspond to younger, 
dustier and more actively star-forming systems than in- 
ferred from BC03 fits, while CB07 fits yield lower SFRs 
and Ay, and older ages. As we will discuss in Section 
O the stellar ages derived from BC03 models without 
stringent constraints on formation redshift or e-folding 
time tend to be implausibly young. This problem may 
therefore be even more pressing when considering M05 
fits. Finally, we note that for both M05 and CB07 fits, 
solutions with ages for which the TP-AGB contribution 
is maximal, seem to be avoided. The apparent bimodal- 
ity in the upper left panel of Figure |6] is a reflection of 
this bimodality in estimated ages. Along similar lines, 
Kriek et al. (2010) recently reported that, for their sam- 
ple of post-starburst galaxies at 0.7 < z < 2.0, a low 
contribution from TP-AGB stars was demanded in order 
to reproduce their spectral energy distributions. 

To conclude, we flnd that, in common for all stel- 
lar population models considered, underestimates of the 
SFR for the dusty, most actively star-forming galaxies, 
that are abundantly present at high redshift, are intrin- 
sic to any dust correction method assuming a uniform 
foreground screen. SED modeling recipes can be tuned 
to reproduce the level of star formation in galaxies with 
more modest SFRuv+ib- 

4.4. Dependence on Extinction Curve 

So far, we always assumed the Calzetti et al. (2000) 
law for the wavelength dependence of the attenuation. 
Although this law is widely used in the high-redshift lit- 
erature, reddening laws are known to vary, at least in the 
local universe, due to variations in dust composition and 
dust size distribution. Dust grains can grow to larger 
sizes as the density of the gas to which they are mixed 
increases, leading to a larger ratio of visual extinction to 
reddening Ry = Ay / E{B - V) (Maiolino & Natta 2002 
and references therein) . At larger lookback times as well, 
such variations in environment may cause the extinction 
curve to vary between galaxies of different type. Along 
these lines, Reddy et al. (2010) recently argued that 
young z ^ 2 galaxies with ages < 100 Myr may follow 
an extinction curve that is different from Calzetti, and 
instead more SMC-like. 

Here, we briefly consider how our results would change 
when adopting the SMC reddening law from Prevot et 
al. (1984) and Bouchet et al. (1985), which increases 
more steeply towards short wavelengths compared to the 
relatively gray Calzetti law. Naturally, when adopting 
the SMC law, less dust (i.e., a smaller Ay) is needed 
to reproduce the observed UV spectral slope. Further- 
more, it is interesting to note that without imposing 
an ad hoc constraint on the minimum t, the best-fit 
r and age distribution are less skewed towards short 
r and young ages. For the same weak constraints on 
r (log > 7), we find that the underestimate of 

SFRs ED with respect to SFRuy+m is reduced to 



- 0.2 dex at low SFRuy+m < 20 Mq/ut when adopt- 
ing the SMC law. At high SFRs on the other hand, 
the discrepancy becomes worse. In terms of x^i half 
of the low SFRuy+iR < 20 Mo/yr systems are bet- 
ter fit with a SMC law than a Calzetti law, while above 
SFRuy+iR > 20 Mg/yr this fraction drops to 20%. 

From the comparison to SFRuy^m and the relative 
distribution of values, we thus conclude that our re- 
sults are at least qualitatively consistent with a scenario 
where steeper extinction curves are appropriate for lower 
SFR systems. Physically, this may stem from a dust 
grain distribution that is biased to small sizes in these 
environments that are typically of lower gas density. For 
simplicity, consistency with previous work, and because 
a good SFR calibration can be obtained provided an ap- 
propriate Tmin (see Section l4.ip . we will proceed in this 
paper to only use the Calzetti law. More direct probes 
of the extinction curve at high redshift, and its variation 
with galaxy luminosity, age, and/or SFR are desired. 

5. SFRS FROM Ha 

We established self-consistent recipes for SFRs based 
on UV -I- IR and SED modeling. Now, we extend the 
cross-calibration of SFR indicators to measurements of 
the Ha line luminosity, as obtained by SINFONI on 
VLT as part of the SINS survey (Forster Schreiber et 
al. 2009). The benefit of using integral field spec- 
troscopy is the absence of slit losses, which complicate 
the absolute calibration of total Ha luminosities from 
slit-based measurements (e.g., Erb et al. 2006; Reddy et 
al. 2010). This sample comprises 25 star-forming galax- 
ies in the GOODS-South field spread over the redshift 
range 1.5 < z < 2.6, out which 19 are significantly (> Scr) 
detected in Ha. SINFONI exposure times ranged from 
1 to 8.5 hours. Tests on partial data sets of the longest 
exposed targets indicate that possible sensitivity-driven 
fiux losses for the shortest exposed objects are 30%. 19 
out of 25 sources are significantly detected in the MIPS 
24 ixm band, and for 9 of those we extracted PACS fiuxes 
with S/N > 3. We derived monochromatic L/^ using 
the longest wavelength IR band that has a significant 
detection. In addition to SFRuy+jn values for the IR- 
detected sources, we also compute SED modeled SFRs 
for all 25 galaxies in this sample. The two continuum- 
based estimators are contrasted to the line-based mea- 
surement SFRho in Figure [T] 

The Ha fiuxes were converted to star formation rates 
SFRho following the prescription of Kennicutt (1998), 
which assumes case B recombination, corrected to a 
Chabrier (2003) IMF. Not unexpectedly, the rates are 
severely underestimated when no dust correction is ap- 
plied {left panel). Using the Calzetti et al. (2000) red- 
dening law, the visual extinction Ay translates to the ex- 
tinction at the rest-wavelength of Ha as Ana — 0.82^^. 
When adopting the visual extinction as inferred from 
broad-band SED modeling Ay^ sed {middle panel), we 
find the offsets in SFR to be reduced, but still amounting 
to underestimates of 0.55 dex. 

Indeed, it was found by Forster Schreiber et al. (2009) 
that the SINS data are consistent with roughly twice 
higher attenuation towards the HII regions relative to 
the bulk of the stars. Pioneering measurements of the 
Balmer decrement Ha/Hl3 in high-z galaxies seem to 
confirm the increased extinction towards HII regions, al- 
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Fig. 7. — SFRs of SINS galaxies at 1.5 < z < 2.6 as derived from continuum measurements (UV+IR and SED modeling methods) 
compared to the star formation rate as inferred from the Hct line luminosity under various assumptions about its extinction. In the absence 
of a dust correction, or when simply applying the dust correction as inferred from broad-band SED modeling, the //ct-based SFR is clearly 
underestimated with respect to other indicators. Adopting the locally calibrated extra attenuation towards HII regions compared to the 
continuum yields a better correspondence to the other SFR indicators. 



beit with a significant galaxy-to-galaxy scatter in the 
precise degree of differential extinction (Yoshikawa et al. 
2010; Muzzin et al. 2010; Buschkamp et al. in prep). 

In nearby galaxies, it has long been established that 
HII regions are often associated with dustier regions 
than the bulk of the stellar populations across galaxies 
(Calzetti et al. 1994, 2000; Cid-Fernandes et al. 2005). 
Studying a sample of low-redshift star-forming and star- 
burst galaxies, Calzetti et al. (2000) formulated the em- 
pirical calibration Ay, neb = ^v,S£;r>/0.44 where Ay^neb 
and Ay^sED refer to the visual extinction towards the 
nebular regions and the stellar continuum respectively. 

Here, the offset in the middle panel of Figure [7] re- 
flects this need to take into account differential extinc- 
tion. Applying the local calibration to our high-redshift 
sample {right panel of Figure [7]), we find the Ha-based 
SFR to line up well with estimates based on UV-I-IR 
and SED modeling. We measure a median A log SFR = 
log SFRcontinuum " logSFRua of 0.08, with a 0.33 dex 
scatter. We verified that our analysis leads to the same 
conclusion when deriving Av^sed using M05 models. 
Furthermore, the 3 objects in our sample that are X- 
ray detected, among which object ID6202 whose broad 
Ha line profile suggests the presence of AGN activity 
and/or shocks, do not deviate from the remaining galax- 
ies in our sample, implying that contributions to the Ha 
luminosity by non-stellar processes do not dominate the 
observed trend. 

Using SFR indicators from UV to IR wavelengths, we 
verified the findings on differential extinction by Forster 
Schreiber et al. (2009), which are also consistent with 
recent work by Onodera et al. (2010) and Mancini et al. 
(2011). In contrast, Reddy et al. (2010) found that such 
an extra attenuation correction towards HII regions may 
not be necessary. This may be due to observational ef- 
fects (e.g., uncertain slit loss corrections), but differences 
between the intrinsic properties of the galaxy samples 
could play a role as well. The galaxies studied by Reddy 
et al. (2010) typically have lower stellar masses. Their 
extinction may differ due to different dust properties or 
distributions. 



6. LIFE PATHS THROUGH THE SFR VERSUS MASS 
DIAGRAM: A CONSISTENCY CHECK 

With the above calibrated 'ladder of SFR indicators' 
and SED modeling recipes in hand, we now perform a 
galaxy population study, linking progenitors and descen- 
dants in the SFR versus mass diagram over a range of 
redshifts, and testing for consistency. We detail how ob- 
served populations at high redshift are compared to the 
expected populations inferred from observations at lower 
redshift in Section [OI In Section |6^ we carry out this 
methodology multiple times, varying only the constraints 
on the galaxies' SFH. Here, we make the case for under- 
estimated galaxy ages as a significant contributor to the 
observed discrepancies. After addressing merging as an- 
other potential contributor (Section 16. 3|) . we return to 
the issue of constraining galaxy ages in Section 16. 4[ ar- 
guing for an extended wavelength baseline, or spatially 
resolved SED modeling as avenues for improvement. 

6.1. Methodology 

If we ignore merging, a galaxy with a mass Mobs in 
stars at the epoch of observation can be considered as 
one unit since its onset of star formation at time i/orm, 
and built up its stellar content following 

rtobs 

Mobs — I SFR{t — tform)flossitobs — t) <^t. (2) 

Here, the fraction fioss{t) accounts for stellar mass loss. 
This fraction of mass remaining in stars first declines 
rapidly once the first supernovae go off, then decreases 
more gradually by winds from AGB stars, reaching ~ 
0.6 after 1 Gyr, and eventually levels off at ^ 0.5 for a 
Chabrier IMF (BC03). 

As age will be an important parameter in our analysis, 
it is useful to point out that two definitions of galaxy age 
are frequently used in the literature; one being the time 
since the onset of star formation: 

— tobs Inform: (3) 
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and the other being a measure of the age of the bulk of 
the stars: 

ageuj ^ / SFR{t - tform){tobs - t) dt / 

[ SFRit-tform)dt. (4) 

For exponentially declining SFHs, age^ ranges between 
0.5 X age (r — oo) and 1 x age {t — 0). Throughout this 
paper, when referring to age^, we will explicitly state 
'SFR- weighted age'. 

Given equation [21 a particular functional form (in our 
case parametrized by t) for the star formation history 
SFR{t) translates to a unique path SFR{M) of the 
galaxy in the SFR versus mass diagram. Consequently, 
the constraints on Afobs, r, and SFRo^s allow us to de- 
termine the location of each galaxy's progenitor in the 
SFR-M diagram at any point earlier in time. For galax- 
ies with a PACS and/or MIPS detection, we apply the 
IR constraints to the SED modeling by searching for 
the minimum solution within a 0.1 dex range around 
SFRjjyj^in. This approach reduces the degrees of free- 
dom by one. In addition, it avoids the underestimates 
by SFRsED at the high-SFR end (see Section [H]) . For 
galaxies that lack a far- or mid-IR detection, we simply 
use the SED modeling performed without constraints on 
the SFR. 

We then compare the galaxy population observed at 
1 < z < 2 to that expected at that epoch from trac- 
ing the < z < 1 population back in time. Likewise, we 
confront the observed 2 < 2 < 3 population to that back- 
traced from 1 < 2; < 2, and the 3 < z < 4 population 
to that inferred from galaxies at 2 < z < 3. Under our 
premise that galaxies satisfy a continuity equation over 
cosmic time, both the location of the main sequence of 
star formation, and how densely it is populated should 
match. The presence of any discrepancies could hint at 
ill-characterized SFHs, breaking of number conservation 
by merging, or cosmic variance leading to one redshift 
interval being under- or overdense with respect to the 
other. 

6.2. Tracing Galaxies back in Time 

6.2.1. T Models 

We first perform the above described galaxy popula- 
tion analysis adopting exponentially declining SFHs with 
a large range of allowed e-folding timescales (r > 30 
Myr) . In Figure |8^, we contrast the observed SFR-M re- 
lation {red) to that inferred from the SED modeled SFHs 
of galaxies in the previous redshift bin ( black) . The black 
and red symbols were plotted in a random order, in order 
to preserve information on the relative abundance of the 
two populations, even in crowded regions of the diagram. 
At all redshifts, the backtraced model matches the ob- 
served main sequence of star formation reasonably well 
in terms of zeropoint, slope, and scatter. In particular, 
even though r models by definition do not allow for in- 
creasing SFRs, the upwards shift of the main sequence 
from z = to z = 3 is captured by the model. As is 
empirically well established, galaxies with lower specific 
star formation rates exist alongside normal star-forming 
galaxies at least out to z ^ 3 (see, e.g., Labbe et al. 



2005; Kriek et al. 2006). Their relative abundance is 
an increasing function of galaxy mass, and declines with 
increasing redshift (Fontana et al. 2009). We note that, 
while these quiescent systems are abundantly present at 
1 < z < 2, they are missing from the backtraced pop- 
ulation computed for that redshift bin. Galaxies with 
similarly low specific star formation are present in our 
< z < 1 sample. This suggests that we are not merely 
missing the descendants of 1 < z < 2 quiescent galaxies 
due to the smaller volume probed at < z < 1. In gen- 
eral, it is the case that the backtraced number densities, 
both on and off the main sequence, are at all redshifts 
lower than those directly observed. 

This result is quantitatively better appreciated when 
we collapse the SFR-M diagram along the SFR axis, re- 
sulting in the mass functions plotted in Figure [QK- Here, 
the solid lines represent the observed mass function at 
different redshifts. In agreement with Marchesini et al. 
(2009), whose analysis was partly based on the same data 
set, we observe fairly little evolution in number density at 
the high-mass (> 10^^ Mq) end, compared to a steeper 
build-up of stellar mass with cosmic time in the inter- 
mediate (~ 10^° -^0) regime. We did not correct the 
observed mass functions for incompleteness. Instead, we 
apply the K — 24.3 observed magnitude limit of the 
FIREWORKS catalog to the galaxies that were evolved 
back in time, leading to the dashed curves in Figure [9K. 
This approach allows for a fair comparison as long as 
galaxies' higher-z progenitors have fainter observed K- 
band magnitudes than their descendants at the epoch of 
observation. 

Figure [9^ clearly shows that the backtraced popula- 
tion, as based on SED modeling with r models with a 
large freedom in t, is underpopulated. The underesti- 
mate of the number densities increases with decreasing 
stellar mass, becoming dramatic at intermediate masses, 
particularly for the lower redshift bins, where it exceeds 
an order of magnitude. As we will see in Section r6.31 such 
large differences in number density are unlikely to result 
from merging of galaxies alone, a process that we ignored 
so far in our analysis. An underdensity of the GOODS- 
South field in the < z < 1 interval (Wolf et al. 2003) 
may contribute to the offset seen between the observed 
and backtraced 1 < z < 2 population. However, the 
same trend is seen at all redshifts. This cannot be due 
solely to field-to-field variations unless each redshift bin 
is underdcnsc relative to the one above. Moreover, the 
magnitude of the offsets seems incompatible with field-to- 
field variations. Following the methodology of Somerville 
et al. (2004), the expected uncertainty due to cosmic 
variance on the number density of logM > 10.5 galaxies 
at0<z<l, l<z<2, 2<z<3, and 3 < z < 4 
as derived from the GOODS-South field is 34, 24, 26 
and 32% respectively, much smaller than the discrepan- 
cies between the observed and backtraced populations in 
Figure |9^. The most straightforward interpretation is 
therefore that the SED modeled stellar populations are 
biased towards young stellar ages. Given an underesti- 
mated age, we would infer the stellar mass of a galaxy 
to be built up relatively recently. Or in other words, as 
we trace this galaxy back in time, its stellar mass would 
decrease too rapidly, leading to a lower formation red- 
shift at which the galaxy drops out of the backtraced 
sample. The origin of such age underestimates lies most 



11 



& 



ct: 
u. 



o 



o 



t/) 

^ — ^ 

o 



o 



1 1 1 ' 1 

(0) 1 < z < 2 


2 < z < 3 

* 

of B 
1 


1 \" 1 1 1 1 1 1 1 1 1 1 1 

3 < z < 4. 

• 

• 

....•....^t V, 


1 '1 1 1 

(b) 1 < z < 2 
" ■* . • 1 

* ' *i ■* ^ 


2 < z < 3 
... '"^^^^^^ 


1 1 " 1 1 1 1 1 1 1 1 1 1 1 

3 < 2 < 4 , 

• ft/t .» - 

* 


1 1 1 1 

(c) 1 < z < 2 

• . * ° = i 

[■,15, ■^(^ 

; ' ci " ^ ; 

■ » * « 


2 < z < 3 

° ' I 


3 < z < 4" 

^ ': 


!^ ■ ■ ■ ■ " 'f' ■ wr,y^'iw<'w*.>'*Y,^^'>»TP 

(d) 1 < z < 2 

I I ' I I 


2 < z < 3 \ 

1 1 1 — I 


3 < z < 4 i 



9 10 11 12 8 
log(M) [Me] 



9 10 11 12 8 
log(M) [Me] 



9 10 11 12 
log(M) [Me] 



Fig. 8. — Observed (red) SFR versus stellar mass diagram in three redshift bins, compared to the distribution and population expected 
for the same volume from tracing galaxies in the previous redshift bin back in time {black). The following constraints were applied in 
modeling the SEDs with BC03 templates: (a) t models with log(rmi„) = 7.5, (b) r models with log(Tmi„) = 8.5, (c) delayed t models 
with log{Tmi„) = 8.5, (d) delayed t models with log(r^i„) = 8.5 and all galaxies being maximally old. While occupying a similar region 
in the SFR versus mass diagram as the observed galaxies, the backtraced population is undernumerous at all redshifts in cases (a) to 
(c), suggesting an overall underestimate of formation redshifts, especially when no stringent constraints on e-folding time and/or age are 
imposed. 
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plausibly in the fact that younger stellar populations out- 
shine any underlying old population in the integrated 
SED, especially at the shorter wavelengths. Because of 
this outshining effect, any deviation from the simple SFH 
and foreground dust distribution assumed makes it im- 
possible to correctly recover the true SFR-weighted age 
(Wuyts et al. 2009a; Maraston et al. 2010). 

Somewhat earlier formation epochs are obtained when 
requiring longer e-folding timescales of t > 300 Myr. 
Since this also provided a better agreement between SFR 
indicators (Section 14. 1|) . we adopt this value of Tmin in 
the remainder of our analysis. In the median, the SFR- 
weighted age of the galaxies in our sample then increases 
by 0.15 dex. The high-mass end of the observed and 
backtraced mass functions now show a good agreement 
(FigureOj). At lower masses, the offsets remain present, 
although reduced by a few 0.1 dex. The mass dependence 
of the offsets may imply that age underestimates are 
more of a concern for low- to intermediate-mass galaxies 
than for the most massive (often older) systems. 

6.2.2. Delayed t Models 

Recently, several authors have argued, both on obser- 
vational (Renzini et al. 2009; Maraston et al. 2010; 
Papovich et al. 2011) and on theoretical (Finlator et 
al. 2007; Lee et al. 2010) grounds, that galaxies may 
undergo increasing, rather than decreasing, SFHs during 
parts of their life, specifically at early times ^ 3). To 
allow for such phases of increasing SFR without imposing 
an artificial breakdown of our galaxy sample in sources 
fitted with strictly increasing versus sources fitted with 
strictly decreasing SFHs, we repeated our analysis using 
the following SFH: 

SFR{t) ^ te-*/^ (5) 

These so-called delayed t models allow for solutions 
with increasing SFRs (for ages < r) as well as solutions 
in which the SFR is decreasing after a prior phase dur- 
ing which it was increasing (for ages > t). At i <C t, 
SFR{M) ^ \fM for delayed r models as opposed to 
SFR{M) « constant for simple r models. This corre- 
sponds to evolutionary tracks with steeper initial slopes 
in the SFR-M diagram than default r models, but still 
shallower than many literature values for the slope of the 
main sequence. The latter vary between this slope of a 
half and a slope of unity. Its precise value is still widely 
debated (Noeske et al. 2007; Cowie & Barger 2008; Pan- 
nella et al. 2009; Santini et al. 2009; Rodighiero et al. 
2010), and may evolve with redshift (Dunne et al. 2009). 

Figure [TUk shows the age/r ratio obtained by fitting 
delayed t models to the SEDs of < z < 4 galaxies as 
function of SFR. Overall, only 20 % of the galaxies is 
best characterized by being observed during a phase in 
which the SFR is increasing {age < r). However, this 
fraction is a strong function of the level of star-forming 
activity (and therefore redshift). At SFRs above 10, 50, 
and 100 AlQ/yr, galaxies that are best fit by rising SFRs 
account for 40, 60, and 70% of the overall population. 
At > 2, 37% of the galaxies in our K < 24.3 sam- 
ple are found to be in the rising part of their SFH (i.e., 
age < t). We note that the reduced chi-square (xled) 
values of the fits with delayed t models are indistinguish- 
able from those obtained by fitting simple t models. The 



SFR-weighted ages are in the median 0.1 dex older than 
those inferred from simple r models, with a scatter of 
0.15 dex, and a weak trend with age (the systematic off- 
set being smaller for systems that formed the bulk of 
their stars more than a Gyr ago). Despite the modest 
increase in estimated ages, it is clear from Figure [8t and 
that the underestimate of galaxy number densities at 
intermediate masses remains significant for this alterna- 
tive SFH. In other words, in the absence of any stringent 
constraints on age, the delayed r models are subject to 
the same outshining effect as were the simple t models. 

6.2.3. Maximally Early Onset of Star Formation 

Finally, we repeat the population analysis once more, 
now making the assumption that all galaxies are maxi- 
mally old (i.e., age = tHubbie(zobs))- The SFR-weighted 
age agcw may (and will) still be younger than the age of 
the universe at the epoch of observation, but we perform 
the SED modeling forcing the onset of star formation to 
take place at very high redshift. Given the discreteness 
in age of stellar population models, small variations in 
the adopted z/orm occur depending on the galaxy's pre- 
cise redshift, but it is typically around Zform ^ 

10 and 

the precise value is irrelevant for this exercise. We again 
adopt delayed r models, as they result in Xred values 
that are better by a factor of 1.4 than those obtained 
from fitting simple r models with the same restriction 
on Zform- In practice, the least squares solution is now 
often one with r > age (see the abundance of galaxies 
with \og{age/T) < in Figure [TOb). such that the SFR- 
weighted age is minimized within the allowed range that 
goes from 0.5 (for r « oo) to 1 (for r « 0) times the 
age of the universe at the observed epoch. These long 
e-folding times, obtained when maximizing the forma- 
tion redshift, serve not to violate the constraints from 
the longer wavelengths on stellar mass, which tends to 
be the most robust parameter in SED modeling (e.g., 
Forster Schreiber et al. 2004; Shapley et al. 2005). Nev- 
ertheless, we do note that the median stellar mass of 
galaxies in our sample increases by 0.25 dex compared 
to SED modeling where no maximum formation redshift 
was imposed. 

Although extreme, the assumption of maximally old 
galaxies is useful as a sanity check. If the backtraced 
galaxy samples are still underpopulated with respect to 
the observed samples, the origin of the discrepancy must 
most likely be attributed to other effects such as merging, 
cosmic variance, or more extreme SFHs in which an even 
larger fraction of the stellar mass was assembled at very 
early times. Figure [9ji shows that this is not the case. 
At all redshifts, the backtraced mass functions match 
the observed mass functions surprisingly well, and are if 
anything slightly overproducing the number densities in 
some mass bins (an effect that is somewhat more pro- 
nounced if we had adopted simple r models, that lead to 
even older SFR-weighted ages). We note that the SFR-M 
relation (Figure [8]i) has tightened significantly by mak- 
ing the rigorous assumption on early formation epochs. 
This indicates that, aside from an intrinsic scatter ow- 
ing to galaxy-to-galaxy differences in SFH, the scatter in 
the observed SFR-M relation also depends on model as- 
sumptions. Likewise, we find that conclusions on the 
differential evolution of the stellar mass function also 
depend on the assumed SFH. When making similar as- 
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Fig. 9. — Mass functions from z = to 4 as observed (solid curves), and as obtained by tracing lower-redshift galaxies back in time using 
the constraints on their SFHs from SED modehng {dotted curves). No incompleteness corrections are applied. Instead, dashed lines show 
the mass function of the traced-back populations after applying the K = 24.3 magnitude limit. The following constraints were applied in 
modeling the SEDs with BC03 templates: (a) t models with log(rmin) = 7.5, (b) t models with log(rmin) = 8.5, (c) delayed t models 
with log(T,„i,j) = 8.5, (d) delayed r models with log(Tmi„) = 8.5 and all galaxies being maximally old. Chi-square minimization prefers 
to fit steeply declining SFHs with young ages. Such solutions are physically unrealistic, as they violate the galaxy continuity equation as 
tested here. Instead, star formation over more prolonged timescales seems preferred. 



sumptions about the SFH as Marchesini et al. (2009), 
we confirm their finding of fittle evolution at the high- 
mass end, and much more at lower masses (see Figure 
inb)- When instead using delayed tau models in combi- 
nation with tight constraints on the formation redshifts, 
the degree of such differential evolution reduces signifi- 
cantly (see Figure [Hi). 

6.3. The Effect of Merging 

So far, we assumed that each observed galaxy had only 
one progenitor that assembled its stellar mass by in situ 
star formation. In a hierarchical universe, where dark 
matter haloes merge to form increasingly larger struc- 
tures, carrying with them the baryons that assembled 
at the centers of their potential wells, our simplified toy 
model must therefore fail to capture the detailed evolu- 
tion of galaxy populations at some level. In order to 



properly account for galaxies merging (or in fact de- 
merging as we trace them back in time), an accurate 
knowledge of the merger rate as function of redshift, 
mass, and mass ratio would be required. Moreover, the 
probability that a galaxy observed at redshift Zobs under- 
went a merger since redshift Zofcs + Az (and hence has to 
be split in two as we trace it back to that epoch) may 
not only depend on its mass. For example, if merging 
leads to quenching (e.g., Di Matteo et al. 2005), a qui- 
escent galaxy may be more likely to have undergone a 
merger in its past than a star-forming galaxy of similar 
mass. Furthermore, it is well motivated that mergers 
between gas-rich systems trigger starbursts in their cen- 
ters, which inhibits evolving galaxies back along smooth 
tracks in the SFR-M diagram. In fact, the scatter in 
the SFR-M relation of star-forming galaxies may well be 
produced by such burstiness, triggered by (minor) merg- 
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Fig. 10. — Ratio of age (i.e., time since onset of star formation) 
over T as function SFR for delayed r models with (a) weak con- 
straints on age (age > 50 Myr), and (b) maximally old ages. When 
leaving age free, only a fraction of galaxies at the high SFR end is 
best modeled as following a rising SFH at the epoch of observation. 
When imposing very high formation redshifts (2/orm ~ 10) 1 larger 
values of T (> age) are preferred. 



ers or other processes. Clearly, implementing all of these 
processes in our consistency check is beyond the scope 
of this paper. We therefore choose to illustrate the ef- 
fect of merging activity on our analysis by two models of 
increasing complexity. 

6.3.1. A Simple Toy Model for Merging 

The first is an idealized toy model, presented in Fig- 
ure [TT] Here, the black curve represents a hypothetical 
mass function at Zobs + as traced back from galaxies 
observed at Zobs- The dotted line accounts for the mag- 
nitude limit of our catalog. If every galaxy observed at 
Zobs underwent one merger in the interval Az (for sim- 
plicity we assume one that did not affect the SFH), the 
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Fig. 11. — Toy model exploring the impact of merging on tests 
of the galaxy continuity equation. Let the black curve be the high- 
redshift mass function as constructed by evolving a lower redshift 
observed galaxy population back in time, in the absence of merging. 
In a scenario where each galaxy underwent one merger between 
the low and high redshift bin, with mass ratio chosen randomly 
between 10:1 and 2:1, the green curve would be the resulting high- 
redshift mass function. Two merger events (red) or more per galaxy 
would steepen the mass function further. 



actual number of progenitors will be twice as large as an- 
ticipated in the absence of merging. Since the mass dis- 
tribution of progenitors will be shifted to lower values, 
the number of galaxies entering our magnitude-limited 
catalog will be larger by a somewhat smaller factor. Un- 
der the assumption of a merger rate that is a constant 
function of the mass ratio over the range 10:1 to 2:1, the 
resulting mass function is plotted in green. If all of these 
merger progenitors themselves descended from mergers 
taking place since Zobs + Az, the red curve would be the 
backtraced population (consisting of four times as many 
galaxies as observed at Zobs, ignoring incompleteness is- 
sues). The net effect is a steepening of the mass function, 
the magnitude of which depends on the amplitude of the 
merger rate. 

6.3.2. Merging in Semi- Analytic Models 

While conceptually demonstrating the impact of merg- 
ing, our toy model does not contain information on the 
frequency of galaxy collisions, and therefore its conclu- 
sions remain qualitative at best. In order to obtain a 
more quantitative estimate, we turn to a second, more 
physically motivated model in Figure [T^l Here, the 
solid lines mark the galaxy stellar mass functions at a 
range of redshifts as extracted from the Munich semi- 
analytic model (SAM) of galaxy formation by De Lucia 
& Blaizot (2007). The SAM is rooted in the pure dark 
matter Millennium Simulation (Springel et al. 2005), and 
hence incorporates the hierarchical mass build-up char- 
acteristic for a A-CDM cosmology. It furthermore con- 
tains a set of prescriptions to model the baryonic physics 
within this framework, and outputs among other param- 
eters the observed-frame photometry, which we use to 
apply the K-hand magnitude limit of the observations. 
Here, we will not focus on differences between the model 
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Fig. 12. — Stellar mass functions down to an observed X-band 
magnitude of 24.3 derived from the De Lucia & Blaizot (2007) 
SAM which is based on the Millennium Simulation {solid lines). 
Dashed lines indicate the mass functions at 2 = 1.5, 2.5 and 3.5, 
constructed after summing those galaxies that have a common de- 
scendant at z = 0.5, 1.5 and 2.5 respectively. Accounting for merg- 
ing as we track galaxies back in time (solid lines) steepens the mass 
function with respect to an approach that ignores merging {dashed 
lines) . 



and observed mass functions (most notably a relatively 
late build-up of the high-mass end and early build-up 
of the low-mass end in the SAM), these are discussed 
at length by Marchesini et al. (2009). Instead, we use 
the SAM with its physically motivated merger rates as a 
self-consistent testbed to constrain the impact of merging 
activity. 

Since the merger tree of each galaxy is stored, it is 
trivial to perform our analysis from Section 16.11 on the 
SAM, i.e., to evolve galaxies back in time according to 
their SFH ignoring merging. For example, to obtain the 
inferred population at z = 2.5 by winding back the clock 
from z = 1.5, we simply group all z = 2.5 galaxies that 
have a common descendant at z — 1.5, sum their masses, 
and construct the mass function from the resulting sam- 
ple {red dashed line in Figure . In the SAM, by con- 
struction, any difference between the backtraced and ac- 
tual mass function at a given redshift is due to merging. 
As expected from our simplistic toy model (Figure [TT|) . 
the net effect of including merging is a steepening of the 
mass function: while the high-mass end is reduced, the 
number density at intermediate masses increases by 0.25 
dex, 0.15 dex and dex at z = 1.5, 2.5 and 3.5 respec- 
tively. 

We conclude that accounting for mergers will reduce 
the discrepancy in number densities seen in Section 16.21 
at low to intermediate masses, but typical merger rates 
from dark matter -I- semi-analytic models seem insuffi- 
cient to fully resolve the underdensities we found. Invok- 
ing more active merger histories would break down the 
good agreement at the high-mass end (M > 10^^ Mq) by 
suppressing the abundance of those galaxies. Given these 
concerns, we now turn to what is likely a major contrib- 
utor to the offsets between the observed and backtraced 
mass functions in Figure |9^-c: biases in characterizing 



the galaxy-averaged stellar age. 

6.4. Constraining Galaxy Ages 

In Section I6.2[ we found that the backtraced and ob- 
served mass functions over a wide range of redshifts 
matched better when forcing maximally old ages in the 
SED modeling. Although merging and stochastic bursts 
will undoubtly affect the evolutionary tracks of galax- 
ies in the SFR-M diagram, it seems unlikely that they 
can account for the discrepancies observed when leaving 
formation redshifts as a free parameter in the SED mod- 
eling. We therefore turn to the measurement of galaxy 
ages, and discuss two avenues for improvement. 

6.4.1. Extending the Wavelength Range 

First, it is illustrative to contrast the best-fit stellar 
population models with and without the constraint of a 
maximally early onset of star formation. The top two 
panels in Figure [13] show the SEDs of two quiescent 
galaxies at z ~ 1.5, while the bottom three panels repre- 
sent cases at the same redshift of galaxies populating the 
main sequence of star formation. Irrespective of galaxy 
type, we note that both the free age and the maximally 
old best-fit model reproduces the broad-band SEDs re- 
markably well. By and large, the two templates are indis- 
tinguishable for broad-band tracers, except at rest-frame 
wavelengths shortward of 0.2 /im, and to a lesser ex- 
tent at the longest wavelengths probed {Xrest ^ 2.5/im). 
Present-day stellar population synthesis codes are fac- 
ing the largest differences in the latter regime (Maraston 
2005). We therefore conclude that more stringent con- 
straints on the Xrest < 0.2/.tm SED, from deep FUV, 
NUV, and [/-band observations, would help most to bet- 
ter constrain galaxy stellar ages without the need to im- 
pose ad hoc constraints on formation redshifts. Rettura 
et al. (2010), among others, recognize this, and succes- 
fully exploit rest- frame UV data of z ^ 1.2 ellipticals in 
clusters and field environments to constrain both their 
onset and timescale of star formation. 

6.4.2. Resolved SED Modeling 

A second avenue for improvement concerns not so 
much the wavelength range, but rather the scales probed 
with multi-wavelength photometry. The outshining ef- 
fect that limits our ability to constrain galaxy ages re- 
sults from an older, underlying stellar population whose 
existence is not captured by the best-fit SFH because the 
SED is dominated by the younger and less obscured part 
of the stellar population. If such multiple components of 
the galaxy's SFH are spatially disjoint, as might be ex- 
pected, we may bypass or at least reduce the outshining 
effect by means of resolved SED modeling. The advent 
of the WFC3 camera onboard HST, in concert with the 
long-time workhorse ACS, opens a window on such anal- 
yses. Wuyts et al. (in prep) present in detail the implica- 
tions of resolved stellar populations at high redshift, but 
for our present purpose we focus solely on the estimate of 
SFR- weighted ages (i.e., the age of the bulk of the stars). 
Here, we use the WFC3 Y, J, and H data of the Early 
Release Science (ERS) program as testbed, in combina- 
tion with ACS B, V, i, z data in GOODS-South matched 
to the same resolution (0.16", sampled by a 0.06" /pix 
drizzled pixelscale). For a sample of 282 galaxies with 
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Fig. 13. — Rest-frame broad-band SEDs of 2 ~ 1.5 galaxies, 
with best-fit BC03 template leaving age free (blue), or forcing a 
maximally old onset of star formation {red). The top two panels 
present quiescent galaxies, whereas the bottom three panels present 
SEDs of galaxies on the main sequence of star formation. Stronger 
observational constraints on the rest-frame 0.1 - 0.2 fiia regime are 
desired to improve the estimate of stellar ages. 



K < 24.3 and 1.5 < z < 2.5, we perform a default SED 
modeling analysis on the summed BVizYJH photome- 
try of all galaxy pixels with S/Nh > 3. We ignore any 
aperture corrections, which may well be important for es- 
timates of, e.g., total stellar mass if a substantial fraction 
of the galaxy mass resides in the outskirts where pixels 
have S/Nh < 3. However, it is irrelevant for our present 
discussion on galaxy age and how its estimate depends on 
an integrated versus resolved approach, because in both 
cases we work with the same set of pixels (namely those 
with S/Nh > 3). Next, we run the SED modeling pro- 
cedure on a pixel-by-pixel basis, yielding a best-fit mass, 
age, r, and Ay for each object pixel individually. Our 
approach is similar in spirit to the recent study by Zi- 
betti. Chariot & Rix 2009 who spatially resolve the stel- 
lar populations in nearby galaxies (obviously to scales 
much smaller than probed in our high-redshift sample). 
We confirm that the sum of all best-fit templates from 
the pixel-by-pixel analysis provides an excellent match 
to the integrated galaxy SED. In Figure [Ml we compare 
the SFR-weighted age obtained from pixel-by-pixel SED 
modeling to that preferred by fitting one r model to the 
integrated galaxy photometry. Here, the SFR-wcightcd 
age inferred from resolved SED modeling is computed fol- 
lowing equation El but the effective SFH over which one 
integrates is now not a simple r model, but the super- 
position of r models, each with their own normalization, 
T and age, for each of the object pixels. We observe a 
clear trend of increased inferred stellar ages, ranging from 
to 2 orders of magnitude, when adopting a resolved 
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Fig. 14. — Comparison of the SFR-weighted age of galaxies with 
K < 24.3 at 1.5 < z < 2.5, as determined by fitting simple r models 
to the ACS/WFC3 BVizYJH photometry of each object pixel 
individually, versus fitting a simple r model to the integrated SED. 
Spatially resolved SED modeling results in higher age estimates 
since variations in SFIf on galaxy-wide scales can be accounted 
for, hence reducing the outshining effect by young stars. 



SED modeling approach. The offsets are small for galax- 
ies where integrated SED modeling indicates old stellar 
populations and low SFRs, specific SFRs, and visual ex- 
tinctions. For systems where younger ages, and higher 
SFRs, SFR/M, and Ay are inferred from integrated SED 
modeling, the age correction factors exhibit a large scat- 
ter. Formally, the offsets correlate most strongly with 
galaxy age. We derive a mean correction 



log age. 



w .resolve: 



ived = 6.86 - 1.75 log age^, integrated (6) 



with a scatter increasing from 0.1 dex for the systems 
with the oldest age^^ ^integrated to 0.4 dex for those with 
the youngest age^j, integrated- 

This behavior may partially be driven by the condition 
that the time since the onset of star formation must be 
less than the age of the universe at the epoch of observa- 
tion (galaxies with old agcw, integrated already lie close to 
this upper bound). We do find only a very weak corre- 
lation with angular size (i.e., the number of independent 
resolution elements in which the galaxies are resolved) , a 
property that itself correlates, at a given mass, with star 
formation activity (Franx et al. 2008; Toft et al. 2009). 

In order to rule out any systematic biases by fitting 
to the low S/N SEDs of individual pixels, we applied 
the same integrated and resolved SED modeling proce- 
dure to a set of 500 mock galaxies with a uniform stellar 
population and representative stellar masses, sizes, and 
i?-band pixel flux distributions. Our toy galaxies vary 
in e-folding time between 30 Myr and 3 Gyr, in the time 
since the onset of star formation between 100 Myr and 
the age of the universe, and were attenuated by a uniform 
visual extinction of Ay = 1. We computed their broad- 
band SEDs as observed at z — 2, and disturbed the pixel 
fluxes appropriately according to the pixel-to-pixel rms 
of the real data. We confirm that both the integrated 
and resolved SED modeling methods correctly recover 
the input SFR-weighted ages, to within a few percent. 
The central 68th percentile interval of the Alogage^ 
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distribution broadens by 35% for resolved compared to 
integrated SED modeling, but no systematic bias is in- 
troduced by the method. Based on this sanity check, 
we can state with confidence that the systematic offsets 
observed for the ERS2 galaxies are due to real spatial 
variations in their stellar populations. 

Given the limited resolution, we can only account for 
differences in SFH across the largest galaxy- wide scales. 
The outshining may well remain present on scales below 
the resolution of the WFC3 imaging (0.16" FWHM, cor- 
responding to 1.3 kpc at z — 2, sampled by drizzled pixels 
of 0.06"). We note that the effective visual attenuation 

Av,resolved, COmputcd aS 



Type of SFR indicotor 





SED 


UV+24^m 


UV+PACS 



A 



CNpix > 
^ ^ -^V,intrinsic,i 
1^1 / 



/ Npix 

-2.5 log ^v^- 



ttenuated,i I ; 



(7) 



also increases when performing resolved SED modeling, 
by 0.2^0 2 mag. This technique may therefore help to 
alleviate both the outshining effect and the saturation of 
reddening by patchy dust obscuration (see Section l4.ip . 
Obviously, its use is limited to special data sets rich in 
multi-wavelength high-resolution imaging. Finally, de- 
spite the strong observed trends, we caution that SED 
modeling on an individual pixel basis is still subject to 
the same age-dust degeneracy that is well-known from in- 
tegrated SED modeling, particularly when based on the 
relatively narrow observed B-to-H wavelength baseline. 

7. SUMMARY 

We compared SFR indicators out to redshift z — 3 
relying on PACS far-infrared, MIPS 24 ^m, and UV 
emission, J7-to-8 fiia SED modeling, as well as Ha spec- 
troscopy. Figure fTSl illustrates that, in order to study the 
entire galaxy population (above a certain mass) out to 
high redshift, a combination of self-consistent SFR indi- 
cators is required. This applies even more so to studies 
that exploit larger areas that often received only shal- 
lower PACS and/or MIPS imaging. This study aims 
at providing such a series of cross-calibrated recipes. 
In Wuyts et al. (2011b), we exploit this continuity 
across SFR indicators in concert with high-resolution 
HST imaging to address the relation between the level 
and mode of star formation, and its mass dependence. 

Using the deeper PEP data in GOODS-South, we con- 
firm previous reports by Nordon et al. (2010) and El- 
baz et al. (2010) that luminosity-dependent conversions 
from observed 24 /im to Ljfj based on the CEOl and 
DH02 template libraries lead to overestimated Lju (and 
hence SFRs) at the highest SFRs {SFR > 100 M^/yr) 
and redshifts (z > 1.5). This trend is also consistent 
with early findings by Papovich et al. (2007) on the ba- 
sis of stacking of MIPS 70 /im and 160 /im data. Using 
the luminosity-independent conversion by W08, which is 
based on a template with enhanced PAH emission rel- 
ative to local ULIRGs, gives Lm based on 24 iim that 
are in the median consistent with PACS photometry, al- 
though with a 0.25 dex scatter. 

SED modeling recipes can be tuned to reproduce 
SFRuv+iR for systems with low to intermediate SFRs 
(e.g., by adjusting the minimum allowed e-folding time 




log(u) [M»l I09(M) [Mj log(u) [M,] 

Fig. 15. — SFR versus mass diagram in the GOODS-South, 
GOODS-North, and COSMOS field. The color-coding illustrates 
the SFR indicator used, from SED modeling for all galaxies in a 
SFR-mass bin (black) to SFRjjy+pacs foi' galaxies in a SFR- 
mass bin (red). Blue regions contain predominantly galaxies with 
SFRuv+24,tj.m as primary SFR indicator. A combination of SFR 
indicators, as cross-calibrated in this paper, is necessary to fully 
probe the main sequence of star formation, particularly in wider 
fields with shallower PACS and MIPS depths. 



for exponentially declining SFHs). The relatively long 
(at least several 100 Myr) e-folding times required for 
SED modeling with BC03 to match SFRjjv+ir implies 
that most star formation happened in a relatively stable 
mode, varying slowly on timescales that correspond to of 
order ~ 50 dynamical times. 

Galaxies at the high SFR end (5*^^^ > 100 Ma/yr) 
tend to be dusty, and are increasingly common toward 
high redshifts. Simple dust correction methods assum- 
ing a uniform foreground screen fail to recover their total 
amount of star formation. Due to the patchiness of the 
dust distribution in these objects, reddening saturates as 
a tracer of extinction. This bias applies to SED model- 
ing as well as methods based on single colors, that are 
designed for a specific (high) redshift range (Daddi et al. 
2007a). At the low SFR end, we find a hint of steeper, 
more SMC-like extinction curves than the Calzetti et al. 
(2000) law. Such a trend could occur if a lower gas den- 
sity in those systems lead to a size distribution of the 
dust grains mixed with the gas that is biased to small 
grains. 

SFRs based on Ha luminosities of star-forming galaxies 
at z ^ 2 show a good correspondence to SFRuv+ir. 
and SFRs ed , provided we account for extra attenuation 
towards HII regions compared to the stellar continuum 
(see also Forster Schreiber et al. 2009). The locally 
calibrated scaling Ay.neb = Ay^sed/OAA holds out to 
z « 2.6. Since the SINS Ha measurements were obtained 
in integral field mode, our conclusion is not subject to 
uncertainties in slit correction factors. 

Next, we fixed the SFR in SED modeling to 
SFRjjv+iR^ avoiding biases inherent to dust correction 
methods and reducing the degrees of freedom by one. 
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The SFHs inferred from SED modeling are tested for 
consistency with a no-merger galaxy continuity equa- 
tion. Briefly, we compared the observed SFR-M relation 
and mass function at a range of redshifts to backtraced 
analogs based on evolving the observed galaxy popula- 
tion at lower redshifts back in time. We find the locus 
of the main sequence of star formation to be well re- 
produced by the backtraced model. However, number 
densities of intermediate- to low-mass galaxies are under- 
estimated. This discrepancy is observed both for simple 
r models and delayed r models. Accounting for mergers 
is unlikely to resolve the discrepancy without removing 
the agreement at the high-mass end. A better agreement 
is only obtained when forcing early formation redshifts. 

Independent evidence from resolved SED modeling 
also implies that stellar population modeling of inte- 
grated photometry leads to age underestimates. Re- 
solving regions that underwent different SFHs helps to 
reduce the effect of outshining of old stellar popula- 
tions by the youngest generation of stars. Alternatively, 
higher S/N data extending to shorter rest- wavelengths 
Xrest < 0.2/ini would provide more leverage on the stellar 
age. 

The authors acknowledge Mariska Kriek for use of the 
FAST stellar population fitting code. S. W. wishes to 
thank Roderik Overzier for stimulating discussions. The 
Millennium Simulation databases used in this paper and 
the web application providing online access to them were 
constructed as part of the activities of the German As- 
trophysical Virtual Observatory. 



APPENDIX 

One concern in exploiting 24 /im emission as SFR in- 
dicator for high-redshift galaxies is a possible contribu- 
tion from hot dust heated by an Active Galactic Nu- 
cleus (AGN) rather than young stars. In order to in- 
vestigate the potential bias induced by AGN in our sam- 
ple, we mark all galaxies associated with an X-ray source 
brighter than Lx > 10^^ erg in the rest-frame 2-10 
keV band with a black open circle in Figure [2j Here, the 
X-ray luminosities were calculated based on the galaxy's 
redshift, and the full band flux and spectral index from 



the Luo et al. (2008) 2 Ms Ghandra catalog, which we 
cross-correlated with the FIREWORKS catalog using a 
search radius of 1" . We note that, owing to the deeper 
PEP observations in GOODS-South, the fraction of X- 
ray AGN with a significant (> Scr) PACS detection is 
larger (- 35%) than the 20% found in GOODS-North 
by Shao et al. (2010). However, among the total sample 
of PACS-detected sources. X-ray AGN form a minority 
(< 10%). 

While the X-ray selected AGN typically have residu- 
als vf^/i^f^^tempiate < 1 in Figure Wp (for a given IR 
luminosity, they are brighter at 24 /xm than the over- 
all PACS-detected galaxy population by a median fac- 
tor 1.4), excluding them changes the median binned val- 
ues (filled black symbols in Figure [2Jd) by less than 8% 
only. We conclude that, whereas contribution by AGN 
can significantly affect the mid- to far-IR SEDs of indi- 
vidual galaxies, their relative number is sufficiently small 
that our above result for the ensemble of PAGS-detected 
galaxies out to z = 3 is robust against such a bias. We 
do however caution that this assessment of AGN contri- 
bution relies on the assumption that all AGN responsible 
for hot dust emission have Lx > erg s~^. If a sub- 
stantial population of Compton thick AGN, undetected 
at X-ray wavelengths, exists (Daddi et al. 2007b), other 
indicators of AGN activity such as a power-law shape of 
the SED in the rest-frame 1-10 ^m regime (see, e.g.. 
Park et al. 2010) have to be explored to properly in- 
fer SFRs from dust re-emission. Exploiting the combi- 
nation of PAGS and Ghandra data in GOODS-North, 
Shao et al. (2010) find that the far-infrared luminos- 
ity of high-redshift AGN shows little dependence on the 
X-ray obscuring column and the AGN luminosity. Fi- 
nally, we note that ultra-deep IRS spectroscopy of 2; ~ 1 
LIRGs and z ~ 2 ULIRGs (Fadda et al. 2010) found 
that both categories are starburst dominated. Based on 
the strong PAH emission in their IRS spectra, Fadda et 
al. (2010) conclude that the compton thick AGN con- 
tribution to the bolometric luminosity of 2: ^ 2 ULIRGs 
is substantially lower than previously inferred by Daddi 
et al. (2007bj. Nordon et al. (2011) also argue for an 
enhanced presence of PAHs in star-forming z ~ 2 galax- 
ies, rather than systematic boosting of 24 fj,m fluxes by 
AGN. 
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